NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Think While You Generate: Discrete Diffusion with Planned Denoising

Liu, Sulin; Nam, Juno; Campbell, Andrew; Stärk, Hannes; Xu, Yilun; Jaakkola, Tommi; Gómez-Bombarelli, Rafael (April 2025, ICLR 2025)

Discrete diffusion has achieved state-of-the-art performance, outperforming or approaching autoregressive models on standard benchmarks. In this work, we introduce Discrete Diffusion with Planned Denoising (DDPD), a novel framework that separates the generation process into two models: a planner and a denoiser. At inference time, the planner selects which positions to denoise next by identifying the most corrupted positions in need of denoising, including both initially corrupted and those requiring additional refinement. This plan-and-denoise approach enables more efficient reconstruction during generation by iteratively identifying and denoising corruptions in the optimal order. DDPD outperforms traditional denoiser-only mask diffusion methods, achieving superior results on language modeling benchmarks such as text8, OpenWebText, and token-based image generation on ImageNet 256×256. Notably, in language modeling, DDPD significantly reduces the performance gap between diffusion-based and autoregressive methods in terms of generative perplexity.
more » « less
Free, publicly-accessible full text available April 24, 2026
Think While You Generate: Discrete Diffusion with Planned Denoising

Liu, Sulin; Nam, Juno; Campbell, Andrew; Stärk, Hannes; Xu, Yilun; Jaakkola, Tommi; Gómez-Bombarelli, Rafael (October 2024, ArXiv)

Discrete diffusion has achieved state-of-the-art performance, outperforming or approaching autoregressive models on standard benchmarks. In this work, we introduce Discrete Diffusion with Planned Denoising (DDPD), a novel framework that separates the generation process into two models: a planner and a denoiser. At inference time, the planner selects which positions to denoise next by identifying the most corrupted positions in need of denoising, including both initially corrupted and those requiring additional refinement. This plan-and-denoise approach enables more efficient reconstruction during generation by iteratively identifying and denoising corruptions in the optimal order. DDPD outperforms traditional denoiser-only mask diffusion methods, achieving superior results on language modeling benchmarks such as text8, OpenWebText, and token-based image generation on ImageNet 256×256. Notably, in language modeling, DDPD significantly reduces the performance gap between diffusion-based and autoregressive methods in terms of generative perplexity.
more » « less
Full Text Available
Discovering Relationships between OSDAs and Zeolites through Data Mining and Generative Neural Networks

https://doi.org/10.1021/acscentsci.1c00024

Jensen, Zach; Kwon, Soonhyoung; Schwalbe-Koda, Daniel; Paris, Cecilia; Gómez-Bombarelli, Rafael; Román-Leshkov, Yuriy; Corma, Avelino; Moliner, Manuel; Olivetti, Elsa A. (April 2021, ACS Central Science)

Search for: All records